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ABSTRACT 

Measurement of the mutual intelligibility of dialects 
of a language is discussed. The focus is on several theoretical 
constructs in measurement, illustrated with data from an earlier 
study of the mutual intelligibility of 17 Chinese dialects. 
Measurement procedures are also explained. It is proposed that mutual 
intelligibility is based on the ability to categorize things and 
recognize patterns, and that phonological correspondences between 
source and target dialects fall into two categories: common 
(communication signals) and less common (noise interference). 
Further, these patterns of correspondence (signal and noise) can be 
weighted according to the characteristics of their occurrence. 
Computation of indexes of mutual intelligibility using these 
weightings, made at the level of the syllable-word in Chinese, is 
explained. Appended materials include: calculations for the mutual 
intelligibility of initial consonants, vowels, endings, and tones of 
two Chinese dialects, those of Beijing and Jinan; a chart 
illustrating degree of dialect affinity of 17 Chinese dialects based 
on mutual intelligibility; a chart illustrating dialect affinity 
based on genetics relations of initials, finals, and tones. Contains 
29 references. (MSE) 
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ABSTRACT 



It is assumed that to a large extent human pattern cognition is 
based on the observation of repeated phenomena. Repetition lends its 
weight of numbers to pattern formation. For a speaker of one dialect 
to understand the speech of another, linguistic correspondence patterns 
for the two dialects have to be established. By quantifying the 
patterns, one can speak of the degree of mutual intelligibility. Much 
of the space of this paper is devoted to discussions of the view point of 
correspondence patterns, the concepts of source dialect, target dialect, 
signal, and noise, and the weight scale proposed to quantify the level 
of signal and noise of correspondence patterns. As an example of the 
intelligibility computation based on phonology, over 2,700 syllable- 
words in the Hanyu Fangyin Zihui (Beijing University 1962, 1989) 
were used to calculate the intelligibility degrees for all the pairs of the 
17 dialects contained therein. The degrees of intelligibility for all the 
dialect pairs are used in cluster analysis to establish a grouping of 
dialects. 



LANGUAGE LEARNING LABORATORY 
College of Liberal Arts and Sciences 
University of Illinois at Urbana-Champaign 



Technical Report No. LLL-T-21-93 



QUANTIFYING DIALECT MUTUAL INTELLIGIBILITY 



Chin-Chuan Cheng 

Jubilee Professor of Liberal Arts and Sciences 
Professor of Linguistics, of Chinese, and of 

English as an International Language 
Director of Language Learning Laboratory 
University of Illinois at Urbana-Champaign 



May 1993 



Language Learning Laboratory, University of Illinois at Urbana-Champaign 
G70 Foreign Languages Building, 707 South Mathews, Urbana, Illinois 61801 



2 



As students of language we were often asked the question "bow different are Chinese dialects?" 
In the past, our standard answer to questions of this sort was to give a classification of the dialects. For 
example, we would say that Beijing belongs to Mandarin and Xiamen belongs to Southern Min. To a 
layman, that answer did not make much sense. So the follow-up question was usually the following: "Are 
they mutually intelligible?" At that point, we were as lost as the layman. How could we talk about dialect 
similarity and mutual intelligibility? 

Traditionally, dialects are classified in terms of qualitative differences of language 
characteristics. The characteristics may be represented by isoglosses - lines connecting places that have 
identical entities, They may be stated as rules, either synchronic or diachronic. They may be summarized 
as presence or absence of certain features. For example, from the point of view of the existence of voiced 
obstruent initials, we may put Suzhou of Jiangsu and Shuangfeng of Hunan in one group and other 
varieties of Chinese in another. If we focus on the presence or absence of the consonantal ending of the 
syllable, then Shuangfeng and Beijing can be. grouped together and Suzhou has to be put in a different 
class with some other dialects. But then, how do we arrive at an overall classification of Chinese dialects? 
We often rely on the relative importance of features. So we claim that some features are more important 
than others, and use the important ones to establish a grouping. How do we determine the important 
weight of features? Decisions can be made on the basis of personal qualitative judgments, which can be 
subjective. If we wish to achieve the objective of expressing things in terms of verifiable measurements, 
then we need to quantify the weight of the characteristics. Over the past 70 years the Chinese linguistics 
community has cumulated sufficient dialect data to allow for measurements of linguistic matters. As the 
large amount of data is becoming information overload, we need to make some meaningful synthesis. 
Indeed, as Wang (1991:3) puts it, "the real intellectual rewards come from the theoretical interpretations 
of these data." In the past few years, quantitative studies have contributed to the discussion of 
measurement methodologies and brought to light several new issues in linguistics. 

A very brief review of the methodologies and issues may be in order. Measurements of linguistic 
similarities existed long ago. Kroeber and Chretien (1937) and Reed and Spicer (1952) used correlation 
methods to quantify language distance. In the 1950's glottochronological studies focused on the particular 
issue of time-depths of languages. Wang (1960) was the first attempt to apply such a lexicostatistical 
method to determine linguistic relationships among five Chinese dialects. However, only in the late 
1970's and early 1980's more systematic investigations of dialect affinity using modern computer 
technology with a vast amount of data began to appear. My initial formulations of affinity inquiry (Cheng 
1982, 1986, 1987, 1988, 1991) have been improved by several in-depth studies. Ogura (1990) and Wang 
(1987) also use similarity measurements of some sorts to compute numerical indices of language affinity. 
Essentially, the affinity measurements are based on the ratio of shared items to the total number of items 
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considered. The items may be lexical entries, phonological elements, or other entities, but they all exist in 
morphemes or words. Lu and Cheng (1985) further state that words are carriers of sound change and that 
a large quantity of them used in deriving the numerical indices of dialect correlation will account for the 
weight or importance of features. However, as each investigation has its own focus, selection of 
characteristics becomes restricted and the weight problem is further complicated. For example, Wang and 
Shen (1992) state simply that the first step in determining numerical taxonomy is selection of 
characteristics. The statistical question of sample versus population needs more discussion than has been 
presented so far. On the other hand, the Pearson correlation method that I originally used has been shown 
to have a tendency to inflate the weights of the features that are absent in the pair of dialects in question 
and are included to account for other pairs. Ma (1989), Tu and Cheng (1991), and Wang and Shen 
(1992) have all discussed this issue and have come to favor the use of Jaccard's coefficient, which excludes 
the non-occurring features in the computation. Another area of concern is the tabulation of words without 
regard to their sharing of internal structures or morphemes. In Cheng (1986, 1991) I mention the need 
for a finer treatment of "word family". Wang and Shen (1992) has presented a way to tabulate all the 
morphemes within words in their quantification process and thus give more appropriate weights to such 
elements as prefixes, suffixes, and reduplication. Regarding dialect studies, the geographical areas that 
have more varied dialectal differences especially need quantitative measurements to establish affinity. For 
example, Lu (1991), and Wang and Shen (1992) deal with the Wu-dialect area, and Lu (1986) and Yang 
(ms) cover certain Min dialects. Quantitative studies of affinity are gathering momentum. 

The other aspects of the questions that were often posed by the layman have to do with how we 
calculate language mutual intelligibility. As far as I know, language mutual intelligibility has not been 
well studied. In the past, the notion of mutual intelligibility was used only loosely in language 
classification. Yet it played an important role in determining how languages or dialects were related. For 
example, in field work we usually asked our subjects if they could communicate with those in their 
neighboring communities. We often used the information so obtained to make statements of dialect 
relationships. 

The affinity measurements mentioned above are similarity measurements. In my view, mutual 
intelligibility involves not only similarity of elements but also the patterns of correspondence between two 
dialects. I therefore feel that it is not entirely appropriate to interpret similarity as mutual intelligibility. 
In the early 1950's, Voegelin and Harris (1951), Hickerson, Turner, and Hickerson (1952), and Pierce 
(1952) asked their subjects to listen to recorded passages and counted the percentage of correct scores as 
the degree of mutual intelligibility. These subjective tests yield subjective results varying according to the 
linguistic experience of the subjects. I feel that a more systematic measurement is required to obtain 
mutual intelligibility. 
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A couple of years ago, I (Cheng 1990) ventured to compute mutual intelligibility indices for the 
17 dialects given in the Hanyu Fangyin Zihui (Beijing University 1962, 1989). The methodology, 
computational procedures, and results are given in Cheng (1992). The main theme of that paper was that 
mutual intelligibility could be studied in terms of language as a system (systemic mutual intelligibility) 
and in terms of individuals' ability and experience (participant mutual intelligibility). The speaker-based 
intelligibility should have its base in the systemic intelligibility. The paper dealt with systemic matters 
only. Systemic mutual intelligibility was calculated according to the correspondence patterns between the 
dialects in question. While correspondence patterns of all linguistic components should be considered to 
derive an overall picture of mutual intelligibility, as an initial step in this endeavor, the correspondence 
patterns examined in that paper were those of the initials, medials, main vowels, endings, and tones of 
Chinese syllable-words. Some correspondence patterns can enhance while others may interfere with 
communication. Therefore the notions of signal and noise woe defined so as to determine the 
contribution of the patterns to intelligibility. A weight scale was proposed to compute the values of signal 
and noise patterns. The intelligibility degree was then obtained from the sum of the values for all the 
patterns divided by the total number of elements considered. 

As much of the space of that writing is devoted to discussion of the procedures to obtain 
mathematical precision and of the results to show the usefulness of the quantitative method, many 
concepts involved in calculating mutual intelligibility need to be further discussed. This paper will 
therefore deal in greater detail with the theoretical constructs of category cognition, signal and noise, and 
weight of elements. Naturally, in the discussion the measurement procedures will be reiterated. 

L Number, Category, and Cognition 

The basis of my analysis of mutual intelligibility is our ability to categorize things around us. As 
we see events happening with repetition, we find patterns of occurrence. The patterns then help us put 
things or events together or in separate entities and thus allow us to form categories. Categories in turn 
are the units that help shape human cognition. Dialect communication, in my view, follows the same 
cognition process. Let us take Beijing and Jinan as examples to illustrate the point. 

Let us first examine the pronunciations (tones being omitted) of the following cognate words in 
Beijing (given first) and Jinan, : 



a. 


nye 


ye 


'to abuse' 


b. 


nen 


lue 


'delicate' 


c. 


na 


na 


'to receive' 
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d. ni jii 'mud' 

e. niau suei 'urine' 

For now, let us look at the initials. We see that the dental nasal in Beijing corresponds to the zero initial, 
the lateral, the dental nasal, the palatal nasal, and the dental fricative in Jinan. From these data we do not 
see any patterns. If they were the only words in the Beijing dental nasal category, then we would have to 
memorize the one-to-one sound correspondence individually. However, of the more than 2,700 worts in 
the Hanyu Fangyin Zihui there are 53 words pronounced with the dental nasal in Beijing. The 
occurrences of their cognate counterparts in Jinan show the following patterns, the number indicating the 
amount of words occurring in each correspondence: 



(2) 



a. 


n 


zero 


3 


b. 


n 


1 


1 


c. 


n 


n 


27 


d. 


n 


P 


21 


e. 


n 


s 


1 



■ As the initials of the 53 words fall into five categories, we have a fuller picture of the correspondence of 
sounds in these two dialects. Of these five patterns we find two types in terms of occurrence: the minor 

■ type of the patterns for 1, 1, and 3 words and the major type of the patterns for 27 and 21 words. The 

■ meanings of majority and minority here are straightforward. Occurrences with a small number of words 
are of the minor type, and occurrences with a large number of words are of the major type. We can better 

| define these terms by using the mean as the reference point and say that minority is the occurrence smaller 

than the mean and majority is the occurrence equal to or greater than the mean. This definition is again 

I straightforward, and it seems reasonable since we often deal with matters using the mean as the reference 

point The implication here is that the mean is considered to represent a psychological level at which 

I point certain aspects of human cognition change more markedly. 

■ In (2) we have a total of 53 words occurring in five patterns. The mean is 10.6. Now we can 

■ restate the two types of correspondence patterns by saying that those whose occurrences are smaller than 
the mean of 10.6 are the minority and those whose occurrences are greater than 10.6 are the majority. A 

| major correspondence covers relalively many words and ,v ;ure provides the speaker of one dialect 

much information about the corresponding sound in the other dialect in question. Once such a knowledge 

■ is gained, dialect communication is possible. On the other hand, since minority patterns contain a small 
number of words, one needs to spend extra efforts in learning the individual entities which do not behave 

I 0 

P d 
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as expected. Therefore we consider majority patterns as communication signal enhancement and minority 
patterns as noise interference. 

The correspondence patterns in (2) now take on an additional attribute of signal or noise 
depending on whether their occurrence is smaller or greater than the mean as given in (3): 



a. 


n 


zero 


3 


10.6 


noise 


b. 


n 


1 


1 


10.6 


noise 


c. 


n 


n 


27 


10.6 


signal 


d. 


n 


P 


21 


10.6 


signal 


e. 


n 


s 


1 


10.6 


noise 



Now we have three noise and two signal correspondence patterns regarding the dental nasal of Beijing 
with respect to Jinan. As signal and noise give different effects to dialect communication, we can use 
them to establish a weight scale to calculate mutual intelligibility. However, at this juncture we need to 
review another aspect of the correspondence patterns given above and discuss the notions of "source 
dialect" and "target dialect". 

2. Source and Target Dialects 

The patterns in (3) are formed on the basis of the dental nasal initial of Beijing. When the dental 
nasal initial of Jinan is used as the basis for its correspondence to Beijing pronunciations, the patterns will 
be different. In fact there is only one pattern as given in (4): 

(4) n n 27 27.0 signal 

All the words with a dental nasal initial in Jinan are pronounced with a dental nasal initial in Beijing. 
The number of words involved is 27, and the mean is 27.0. Therefore this pattern is considered signal. 

As we see the difference between the patterns listed in (3) and that in (4), we recognize the 
unidirectional nature of the correspondence patterns. We therefore need to state the basis of the 
correspondence. In another word, the patterns in (3) are those from the point of view of the Beijing dental 
nasal and that in (4) is the pattern from the point of view of the Jinan dental nasal. Thus we may say that 
Beijing is the "subject" or "source" dialect and Jinan the "object" or "target" dialect in (3). On the other 
hand, in (4) Jinan is the source dialect and Beijing the target dialect. Thus as we build the patterns into 
the formula for calculating intelligibility, we will also need to recognize the directionality of intelligibility. 
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Mutual intelligibility of a pair of dialects will be computed from the two numbers of unidirectional 
intelligibility of the dialects. This will be further discussed later. 

The terms "source" and "target" here refer to how the patterns of correspondence are arranged 
and not necessarily to the view point of the speaker in actual communication. At this point we are 
arguing for the recognition of unidirectional intelligibility. That is, the correspondence patterns have to 
be considered both from the elements in one dialect and from those in the other. I realize that although 
the concepts are simple, these terms are not the most descriptive. In Cheng (1990, 1992) the wards 
"subject" and "object" were used for these concepts; here we have chosen "source" and "target" as a 
wording alternative. As we study intelligibility more in the future we may be able to refine the concepts 
and find better words for them. 

3. Weight Scale 

So far we have shown that our attempt to quantify mutual intelligibility differs considerably from 
measurements of language similarity as discussed at the beginning of this writing. We have also departed 
from the measurements of understood points in subjective tests as first carried out by Voegelin and Harris 
(1951), Hickerson, Turner, and Hickerson (1952), and Pierce (1952) mentioned earlier. The thrust of this 
paper is that similar elements in a pair of languages can carry different weights in communication. The 
weights are determined on the basis of whether a pattern is considered signal or noise. 

We assume that signal patterns enhance communication and noise patterns reduce intelligibility. 
To quantify the differences of these two types of patterns, we may assign tentatively the unit value of 1 to 
signal and -1 to noise as a first approximation. To calculate the unidirectional intelligibility of dialects A 
and B with dialect A as the source dialect, we first tabulate all the correspondence patterns from the 
viewpoint of the elements in. dialect A. Then for each pattern we calculate the value by multiplying the 
number of elements occurring in the pattern by the unit value 1 (if signal) or -1 (if noise). The sum of the 
values thus obtained divided by the total number of the elements considered is the unidirectional 
intelligibility. 

However, in many cases the uniform value of noise does not seem to capture how we feel about 
the extent of interference or confusion. For example, in (3) we see that the dental nasal of some words in 
Beijing are pronounced with the zero initial in Jinan. And Beijing has other, non-cognate, words with the 
zero initial. Thus the zero initial of the words in Jinan could be incorrectly interpreted as belonging to the 
Beijing zero-initial word class. Similarly, the Jinan lateral in (3) could be incorrectly interpreted as the 
lateral in Beijing. With regard to signal, some differentiation is also needed. For example, in (3) the 27 
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words pronounced with the dental nasal in Jinan are pronounced with the identical initial. In this case, 
there is no possibility of confusion. On the other hand, even though the pattern of the dental nasal 
corresponding to the palatal nasal is signal, the palatal nasal does not exist in Beijing. We feel that 
additional parameters are required to interpret the correspondence. Thus the patterns in (3) have to carry 
with them additional caveats as given in (5): 



a. 


n 


zero 


3 


10.6 


noise 


different 


zero occurs elsewhere in Beijing 


b. 


n 


1 


1 


10.6 


noise 


different 


1 occurs elsewhere in Beijing 


c. 


n 


n 


27 


10.6 


signal 


same 




d. 


n 




21 


10.6 


signal 


different 


ji does not occur in Beijing 


e. 


n 


s 


1 


10.6 


noise 


different 


s occurs elsewhere in Beijing 



In addition to the signal and noise types given in (5) we found others in the Beijing-Jinan 
correspondence patterns in (6), (7) and (8). The patterns in (8) are for tones transcribed in the five-point 
notation. 



(7) 



(8) 



a. 


ZL 


zero 


2 


b. 




1 


18 


c. 


ZL 


21 


25 


a. 


s 


9 


5 


b. 


s 




1 


c. 


s 


S 


65 


d. 


s 




7 


a. 


55 213 


668 


b. 


55 42 


9 


c. 


55 55 


5 


d. 


55 21 


4 



15.0 


noise 


different 


15.0 


signal 


different 


15.0 


signal 


same 


19.5 


noise 


different 


19.5 


noise 


different 


19.5 


signal 


same 


19.5 


noise 


different 



171.5 


signal 


different 


171.5 


noise 


different 


171.5 


noise 


same 


171.5 


noise 


different 



zero occurs elsewhere in Beijing 
1 occurs elsewhere in Beijing 

$ occurs elsewhere in Beijing 
ji does not occur in Beijing 

g occurs elsewhere in Beijing 

213 does not occur in Beijing 
42 does not occur in Beijing 

21 does not occur in Beijing 



The occurrence characteristics of the target-dialect elements in the signal and noise patterns can 
now be summarized as follows: 



(9) a. Elements the same as those in the source dialect 

b. Elements different from those in the source dialect: 
i. Elements not occurring in the source dialect 
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ii. Elements occurring elsewhere in the source dialect 

Again, we assume that signal enhances intelligibility and noise reduces it. The occurence 
characteristics can be used to fine-tune the value. Let us consider signal patterns first. If both dialects in 
question have the same elements for the items compared, then the intelligibility should be the highest. 
When the elements are different and the target-dialect elements also appear in other items of the source 
dialect, it is possible to confuse the items. But as signal such a pattern is a major one, and the large 
number of items involved could give a distinct impression that could more than offset confusion. For 
example, the lateral of Jinan corresponding to the voiced retroflex fricative in Beijing with 18 items as 
shown in (6) is somewhat salient even though Beijing has the lateral occurring in other words. When a 
signal pattern of correspondence between an element of the source dialect and a different element of the 
target dialect which does not occur in the source dialec;, the potential confusion is not as serious as when 
the target-dialect element also occurs elsewhere in the source dialect. For example the Beijing 55 tone 
corresponding to Jinan 213 tone in a large number of words can give Beijing speakers a distinct 
impression and therefore can allow for a consistent interpretation of tone 213 as 55. Thus the relationship 
for signal patterns in terms of the amount of communication enhancement of their corresponding elements 
can be given as follows: 

(10) Same > Different and not occurring in the source dialect > Different and occurring 
elsewhere in the source dialect 

Noise patterns are intelligibility reducers since they involve a small number of individual items. 
One may think of these patterns as idiosyncrasies that require extra efforts of attention to individual items. 
To what extent they reduce intelligibility is also dependent on the occurrence characteristics of their 
elements. A noise pattern which has different corresponding elements in the dialects and whose element 
in the target dialect also occurs in other non-cognate items of the source dialect may cause much 
confusion. In my view, that is the worst case for communication. When a noise pattern has the same 
element in both dialects, then the level of confusion is the least. Since noise reduces intelligibility, the 
noise pattern with the same corresponding element should be assigned the smallest negative value. 
Therefore the relationship for signal patterns as given in (10) holds for noise patterns. The only 
difference is that noise patterns take on negative values. If we state the relationship of the corresponding 
elements according to severity of interference, then the direction of the greater signs in (10) can be 
reversed as given in (1 1) to show an ascending order of confusion: 

(11) Same < Different and not occurring in the source dialect < Different and occurring 
elsewhere in the source dialect 



12 



10 



Now that we have the relationship, we will propose the weight scale. If we use a unitary value of 
1 for each item, then the following assignments of weight can be made: 



(12) Signal Noise 
For each item in a pattern, the target-dialect 

a. element is the same as that of the source dialect: 1.00 -0.25 

b. element is different from that of the source dialect 

i. and does not occur in the source dialect: 0.50 -0.50 

ii. and occurs elsewhere in the source dialect: 0.25 -1.00 



First, since signal enhances intelligibility, its values are positive, and since noise reduces intelligibility, its 
values are negative. Second, the highest absolute value is 1, and the absolute values of the other two 
occurrence characteristics are each reduced by half of its precedence. 



This weight scale provides a straightforward numerical interpretation of the occurrence 
characteristics of elements in correspondence patterns. However, a series of questions of theoretical 
import can be raised. For example, earlier in showing the Beijing- Jinan correspondence of phonological 
elements, we assumed that the pattern of /n/ corresponding to HI had the same weight value as the pattern 
of /n/ corresponding to the zero initial. How do we know if this assumption is valid or not? One way to 
resolve this issue is to study what can be called perceptual distance of phonological segments. We will be 
examining the theoretical bases and the implications of our analysis of intelligibility in the near future. 



The examples of correspondence patterns shown earlier are of phonological nature. However, it 
is hoped that the proposed procedures and the weight scale for deriving language intelligibility can be 
used to quantify various types of linguistic components such as phonology, morphology, syntax, etc. Each 
component will have its own items and elements to deal with. We have not worked on all these areas, but 
an attempt to quantify Chinese dialect mutual intelligibility in terms of phonological elements in cognate 
words has produced interesting results. In the following we will recapitulate the work done in Cheng 
(1990, 1992) to illustrate the calculation of dialect mutual intelligibility. 



4. Syllable-based Mutual Intelligibility 



Generally speaking, Chinese words are coterminous with the syllable. To study Chinese dialect 
mut'^al intelligibility, the best starting point is the syllable-word. Words are shared as cognates by the 
dialects, and the elements of the syllable can be used to establish correspondence patterns. Because the 
entire syllable as a unit is too coarse to show the fine details of dialectal differences, we divide the syllable 
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into the five traditional elements of initial, medial, vowel, ending, and tone for measurement. If we give a 
syllable a value of 1 , then each of these elements will have the value of 0.20. This assumes that the initial, 
medial, vowel, ending, and tone have the same weight. It may turn out that the vowel of the syllable is 
weightier in determining perception. But lacking such information we have to use an equal-weight 
valuation for now. According to the general weight scale proposed in (12) the weight scale for each of 
these phonological elements can be determined as follows: 

(13) Signal Noise 

For each item in a pattern, the target-dialect's phonological 

a. element is the same as that of the source dialect: 0.20 -0.05 

b. element is different from that of the source dialect 

i. and does not occur in the source dialect: 0.10 -0.10 

ii. and occurs elsewhere in the source dialect: 0.05 -0.20 

Each value is scaled down to one-fifth from that proposed in (12) to reflect the fact that five elements 
make up one syllable-word. 

We extracted the five elements from the computerized version of the Hanyu Fangyin Zihui 
(Beijing University 1962, 1989) and formed correspondence patterns for all the pairs of the 17 dialects, 
each once as the source dialect and once as the target dialect, thus making 272 pairs. Below we will use 
the Beijing -Jinan pair with Beijing as the source dialect to show how the unidirectional intelligibility is 
calculated. The correspondence patterns for the initials, medials, vowels, endings, and tones are listed 
with the number of items (frequency) and mean in Table 1. As the mean determines whether a pattern is 
signal or noise, the weight according to (13) is also given. The frequency multiplied by the weight is the 
value of the pattern. And as we continue to list the patterns the values are summed and shown. For each 
pattern an example in Chinese character is given as evidence of the correspondence. In the Table the 
initials are ordered in terms of the alphabetic sequence of the letters representing the IPA. The initials are 
followed by the medials /i/, /u/, and lyl (high-front rounded). Then the vowel and tone elements follow. 
The arrangements of these elements in the Table have no particular significance for the calculation 
process. Only the final sum is used to derive the unidirectional intelligibility. 

To understand how the values are calculated, let us look at the first two lines of the Table. These 
two lines show the Beijing-Jinan correspondence patterns with respect to the zero initial in Beijing. They 
show that the Beijing zero initial corresponds to the Jinan zero initial with 290 words and to the Jinan 
velar nasal with 30 words. The mean of these two patterns is 160.00. Since 290 is greater than 160, the 
first pattern is considered signal. Since the corresponding elements in the first pattern are the same zero 
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initial, the weight according to (13) is .20. The value of 58.00 is derived from 290 multiplied by the 
weight of .20. The sum at this point is 58.00. The frequency of the second pattern of the zero initial 
corresponding to the velar nasal is smaller than the mean. The pattern is therefore considered noise. As 
the velar nasal initial does not occur in Beijing, the weight is given as -.10. The frequency (30) is 
multiplied by the weight to yield the value of -3.00. The sum at this point is 55.00 (58.00 - 3.00). 

At the end of Table 1 we obtain the sum of 2,004.25. There are 2,763 syllable- words considered. 
And the unidirectional intelligibility with Beijing as the source dialect is the sum divided by the number 
of syllable-words tabulated, i.e. 2004.25 / 2763 a 0.725. We can tabulate the correspondence patterns for 
Jinan and Beijing with Jinan as the source dialect in the same fashion. Indeed, we calculated all the pairs 
of the 17 dialects. The unidirectional intelligibility with Jinan as the source dialect is 0.713. We then 
calculate the mutual intelligibility by taking the mean of the unidirectional intelligibility values. Thus the 
Beijing-Jinan mutual intelligibility is (0.725 + 0.713) / 2 = 0.719. 

All the 272 (17 x 16) pairs of the 17 dialects have been so computed and the values are given in 
Table 2. Each pair of the dialects in Table 2 has three numbers. The first one is the unidirectional 
intelligibility of the pair with the dialect given in the column as the source dialect. The second number is 
the unidirectional intelligibility of the pair with the dialect given in the row as the source dialect. The 
third number is the mean of these two numbers and is the mutual intelligibility of the pair. 

By definition, the values of intelligibility can vary from -1 to L In these Chinese dialects, the 
unidirectional intelligibility values so calculated vary from the lowest .341 for the Guangzhou-Shuangfeng 
pair and the Wenzhou-Xiamen pair to the highest .799 for the Chengdu-Hankou pair. We calculate the 
mutual intelligibility by taking the mean of the two unidirectional intelligibility values of a pair of the 
dialects. We hope that this mean would not obliterate large differences between the two unidirectional 
intelligibility values of a pair. For example, as shown in Table 2 the Beijing- Jinan value is .725 and the 
Jinan-Beijing value is .713; the difference is .012. Indeed, as we examine all the pairs of the dialects the 
differences between the two unidirectional intelligibility values are all very small, ranging from 0 for the 
pair of Shuangfeng and Chengdu to .114 for the pair of Xiamen and Wenzhou. Empirically, the mean 
does not give a number drastically different from those of unidirectional intelligibility. 

The values show different DEGREES of mutual intelligibility. However, one might wish to know 
if the value of .500 would mean exactly that hearers can understand 50% of what is said. A similar 
question would be at what point of the scale speakers of a pair of languages cannot communicate with one 
another. These ax empirical questions and require empirical studies to find the answers. In actual 
speech communication, the linguistic background, experience, ability, and knowledge of the 
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communication participants all contribute to mutual understanding. This is why I have proposed to make 
a distinction between systemic mutual intelligibility, which describes a pair of languages in terms of their 
systems and correspondence patterns, and participant mutual intelligibility, which is based on systemic 
mutual intelligibility and which takes into consideration personal experience and linguistic ability of 
individuals (Cheng 1990, 1992). 

The intelligibility calculated in terms of the correspondence patterns of syllable-words between 
the pairs of the 17 Chinese dialects is an attempt to quantify systemic mutual intelligibility. The scope is 
limited, leaving out syntactic, semantic, discoursal, and other linguistic correspondence patterns. It is 
hoped that in the future as we know more about this field we will be able to incorporate more relevant 
elements for a fuller account of mutual intelligibility. However, in the past, studies in Chinese language 
standardization, dialect classification, dialect affinity, and historical evolution all had a sharp focus on the 
elements of the syllable-word. It is therefore appropriate to start our mutual intelligibility investigation 
with the syllable-word. Hopefully some meaningful comparisons can be made with these other types of 
studies. 

Various uses of quantified mutual intelligibility are discussed or alluded to in Cheng (1990, 
1992). In the interest of quantitative studies, here we will direct our attention to the use of the 
intelligibility indices to establish a dialect grouping. At the beginning of this writing we mentioned that 
we often relied on some impressionist view of mutual intelligibility to establish dialect subgrouping. Now 
that we have derived the intelligibility indices for the 17 dialects, we can follow some rigorous procedures 
to establish dialect classification. 

There are many ways to determine numerical taxonomy (see Ma 1989 for discussion). Here I 
will follow what I have been doing in the past few years so that a comparison of dialect affinity viewed 
from different perspectives can be made meaningfully. The subgrouping method that I employ is cluster 
analysis using average linking. This method groups the closest pairs first. Then it links the next closest 
dialects. Eventually a classification tree is established. The classification of these 17 dialects is given in 
Figure 1. I will explain the process of linking here. The first step of the process involves ranking the 
mutual intelligibility of the dialect pairs in a descending order. To illustrate this process, we extract from 
Table 2 and list the first 10 closest pairs in (14). Please note that since we are looking at the mutual 
intelligibility and not unidirectional intelligibility values the order of dialect in each pair is immaterial. 

(14) 1 .795 Hankou-Chengdu 

2 .768 Jinan-Xi'an 

3 .727 Beijing-Hankou 
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4 


.726 


Beijing-Chengdu 


5 


.719 


Beijing-Jinan 


6 


.693 


Xi'an-Chengdu 


7 


.685 


Beijing-Xi'an 


8 


.676 


Hankou-Changsha 


9 


.660 


Chengdu-Changsha 


10 


.657 


Jinan-Chengdu 



The closest pair, Chengdu and Hankou, first form a group. In Figure 1 we see that these two dialects are 
grouped together and the lines are joined at .795 on the scale. We then group the next closest pair, Jinan 
and Xi'an. In Figure 1 we see that Jinan and Xi'an are linked at .768 on the scale. The next closest pair is 
Beijing and Hankou. But Hankou has already been linked with Chengdu. It is therefore necessary to link 
Beijing with the Chengdu-Hankou pair. The average linking method obtains the average of the suai of 
the Beijing-Hankou (.727) and Beijing-Chengdu (.726) and link Beijing with the pair of Hankou and 
Chengdu at the average point ((.727 + .726) / 2 = .726). Thus we see in Figure 1 Beijing joins Hankou 
and Chengdu at .726. 

Since the fourth closest pair of Beijing and Chengdu has been treated, we next go to process the 
fifth pair given in (14). Here Beijing and Jinan are to be linked at .719. But Beijing has already been 
linked with Hankou and Chengdu, and Jinan has already been linked with Xi'an. It is necessary to link 
these five dialects together. The average linking point is the average of the values of these six pairs: 
Jinan-Beijing (.719), Jinan-Hankou (.588), Jinan-Chengdu (.657), Xi'an-Beijing (.685), Xi'an-Hankou 
(.635), Xi'an-Chengdu (.693). Thus in Figure 1 these dialects are linked at .662 ((.719 + .588 + .657 + 
.685 + .635 + .693) / 6 = .662). These paragraphs illustrate the use of average linking in cluster analysis. 
As we complete the linking of all the dialect pairs, we obtain the classification tree of Figure 1 . 

The grouping as given in Figure 1 shows a remarkable similarity to what we know about the 
genetic relationships of these Chinese dialects. Suzhou appears at the north-south dividing point Above 
it the dialects are mostly Mandarin except that Changsha is usually considered a separate Xiang dialect. 
The more diverse southern dialects come below Suzhou and are mostly less mutually intelligible. Moutain 
et al (1992:320) using a "neighbor-joining tree" linking method to group siitalar dialects according to 
lexical and phonological correlations also conclude that "Suzhou is not much more similar to the southern 
localities than it is to the northern ones." It seems that our general impressionist understanding of dialect 
mutual intelligibility is very well captured in this numerical taxonomy. 
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It is also interesting to compare this mutual intelligibility classification with the grouping 
established on the basis of genetic information. In Cheng (1986, 1988, 1991) the same Hanyu Fangyin 
Zihui (Beijing University 1962) syllable-words were tabulated for occurrences of the initials, finals, and 
tones according to historical relationships. The calculated historical correlation coefficients were then 
analyzed and linked to establish a classification. This genetic classification is reproduced in Figure 2. In 
this figure we see that Taiyuan stands out alone. This is due to the fact that in Taiyuan the Middle 
Chinese Even tone did not split into two tones as it did in all other dialects. Otherwise, the genetic 
classification is quite similar to that based on mutual intelligibility as given in Figure 1. 

5. Concluding Remarks 

The fact that the results of this quantitative study of mutual intelligibility match well our general 
impression of Chinese dialect mutual intelligibility might engender the feeling that the methodology is 
trivial. In my view, it is significant that the methodology provides a rigorous procedure for calculating 
mutual Titelligibility. It gives a measurement that is verifiable and that is not based on the investigator's 
subjective judgments. Moreover, the numerical indices resulted from this calculation give a more precise 
description of the degrees of dialect relationships. As we said earlier, one of the goals of scientific 
investigations is to achieve the objective of expressing things in terms of verifiable measurements. It is 
my hope that this study has moved linguistic inquiries closer to that goal. 

On the other hand, there are many areas that I have left untouched. I have proposed a 
measurement of mutual intelligibility that takes into consideration the patterns of dialectal correspondence 
and the weights of signal and noise in communication. The Chinese syllable-words conveniently provide 
cognates as the basis for determining correspondence patterns of the initials, medials, vowels, endings, 
and tones. The calculation of mutual intelligibility then seems straightforward and reasonable. It is not 
immediately clear how to apply the procedures that we have presented for syllable-words to other 
linguistic elements such as syntax, semantics, discourse, and pragmatics. A great deal of work on 
quantification of mutual intelligibility remains to be done. 

We mentioned earlier that Voegelin and Harris (1951), Hickerson, Turner, and Hickerson 
(1952), and Pierce (1952) asked their subjects to listen to recorded passages and counted the percentage of 
correct scores as the degree of mutual intelligibility. I know of little work done on mutual intelligibility 
during the 1960s. In the middle 1970s Voiers (1977) presented a diagnostic evaluation of speech 
intelligibility. In recent years, as evaluation of computer synthesized speech was needed for technological 
improvements, speech intelligibility tests were designed or carried out. For example, Mack, Tierncy, and 
Boyle (1990) used Diagnostic Rhyme Test, Meaningful Sentence Test, and Semantically Anomalous 



1 O 



16 



Sentences Test to compare the intelligibility of natural and computer-generated speech presented to native 
English and German speakers. All these studies cited here and those appear in Hawley (1977) arc 
experimental in nature and involve the use of subjects. And speech intelligibility was, by and large, the 
percentage of correct scores made by the human subjects. While these tests are important for evaluating 
human understanding of speech, the results vary from subject to subject and therefore do not account for 
the mutual intelligibility of languages as systems. As I stated earlier, the main interest of this writing is in 
the mutual intelligibility of languages at the systems level. I call this intelligibility systemic mutual 
intelligibility. The intelligibility calculated on the basis of the tests with speakers can be called subjective 
or participant mutual intelligibility. Since a speaker's use of a language is basically defined by the 
language as a system, I feel that systemic mutual intelligibility is the basis for establishing a fuller account 
of participant mutual intelligibility. However, a meaningful proposal for integrating subjective experience 
and ability into systemic mutual intelligibility to account for personal variation has yet to be made. 

As we study further the theoretical issues mentioned or alluded to here in this writing, we hope to 
extend linguistics beyond its current boundaries At this moment, we are not yet entirely ready to answer 
laymen's questions about mutual intelligibility. There are many language matters that we do not yet 
understand. But that is the fun of doing linguistics. 
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value sum: 2004.25 
total number of syllable-words: 2763 
intelligibility: 2004.25/2763= .725 
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Figure 1. Dialect Affinity Based on Mutual Intelligibility 
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Figure 2. Dialect Affinity Based on Genetic Relations of Initials, Finals, and Tones 
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